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I. INTRODUCTION 

Random graphs are entities one encounters in many 
fields of research. Every time one has some objects or 
agents in mutual interaction, one can consider these ob- 
jects or agents as the nodes (vertices) of a graph and 
represent pictorially the existence of the interaction by 
an abstract hnk (edge) connecting two nodes. Hence, 
an epidemic can be regarded as a graph: the nodes are 
the infected people and the links connect those who have 
been infected with their infectors. Likewise, the science 
citation index can be represented by a graph. Of course, 
the world-wide-web is a graph. There are many more 
such examples. Usually the graph structure is fairly ran- 
dom. 

Consider a random graph. Let n denote the degree of a 
node and P„ the corresponding bulk probability distribu- 
tion. When Pn ~ for n ^ 1 it is common to call the 
graphs scale-free. This class of graphs is not described 
by the classical theory of random graphs ||l|, where P„ 
is Poissonian. The subject is not really new, but it be- 
came popular recently, when new data on large graphs - 
in particular on the web network structure j^] - became 
available. 

Scale-free graphs are naturally generated by stochastic 
processes, where the graph size is growing and the ad- 
dition of links is preferential: the probability to attach 
a new link to a node is, roughly speaking, proportional 
to the node degree. Several models of growing networks 
have been proposed jij- The prototype is the model 
worked out in a classic paper by Simon This model 
does not really deal with graphs, but can easily be con- 
verted into a (directed) graph model, as observed in [ pT| . 

In studying random graphs one can adopt two comple- 
mentary approaches: 

- The diachronic approach, where one focuses on the 
time evolution of the graph. The main advantage of this 
approach is that one stays close, in spirit at least, to the 
dynamics operating in Nature. Furthermore, one can dis- 



cuss processes, like aging, which are intimately related to 
the time evolution. To our knowledge this approach is the 
one commonly used in works on scale-free graphs. 

- The synchronic approach, where one considers the 
statistical ensemble of graphs at a fixed large time. In- 
troducing a statistical ensemble enables one to use the 
conceptual tools of statistical mechanics. The dynamics 
of processes producing scale-free networks is presumably 
much more complicated than that of the proposed simple 
models. The resulting randomness is better implemented 
in the synchronic than in the diachronic approach. Gen- 
erating graphs with an a priori given connectivity distri- 
bution seems also easier. This is the approach adopted 
in this work. 





FIG. 1. Three graphs with the same number of identical 
nodes and therefore, in our model, with the same weight. A 
Cayley tree (a) has always a diameter ~ logA^, where A'^ = 
#nodes. A comb-like tree (b) has a diameter ~ N. The 
generic trees, like (c), are devoid of any symmetry and can 
be drawn in a variety of manners, ie have large entropy. Such 
trees made up with nodes of degree one and three have the 
Hausdorff dimension dH = 2 (cf [12]) and therefore their di- 
ameter is typically ~ \/iV. 

Introducing a statistical ensemble of graphs means ipso 
facto that the graphs are weighted. In our model each 
graph is given a weight proportional to the product of 
weights attached to individual nodes and depending on 
node degrees. The latter weights are chosen so as to gen- 
erate the scale-free behavior in the ensemble. Thus, in 
a sense, the graphs - the microstates of the ensemble - 
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are given an " internal energy" . Our general results char- 
acterize, of course, the "typical" graphs only, ie those 
contributing most significantly to the ensemble averages. 
This can be considered as a virtue or as a flaw of the 
approach, depending on the goal one is pursuing. The 
"typicality" of graphs results from the usual interplay of 
energy and entropy (see also Fig. 1). 

A rather general graph ensemble is defined in Sec. II. 
For definiteness, we consider undirected graphs only. We 
use techniques borrowed from field theory, which enable 
one to formulate the problem and to find a number of 
results in a rather elegant fashion. The ensemble is de- 
fined for a given set of input weights. We rapidly focus 
on scale-free connected tree graphs, ie connected graphs 
without cycles. This is the case we have fully under con- 
trol and where many results can be obtained analytically, 
so that a maximum clarity is achieved and our approach 
is exposed the best. We discuss the condition that guar- 
antees that an a priori given connectivity distribution is 
found in the " output" , ie when the weights of graphs and 
their entropy are taken into account. In Sec. Ill we show 
that the ensemble can be in a variety of phases. For each 
of these phases we calculate two important scaling expo- 
nents: the fractal and the spectral dimensions. In Sec. 
IV we discuss what happens when the input weights are 
modified. We show that the scale-free regime is unsta- 
ble. Generically, the ensemble develops two responses: 
either a scale is spontaneously generated or else a "sin- 
gular" node appears, which is connected to almost all 
other nodes (the "one-takes-it-all" scenario) [ p^ . 

A graph model can be used as an event generator in 
Monte Carlo studies of the statistical properties of graphs 
themselves or of the "matter" (eg Ising spins, complex 
spins, etc) living on the nodes iQ. Our discussion will 
also lead to the formulation of a simple computer algo- 
rithm for generating random graphs. Readers more inter- 
ested in this aspect of this work than in the more formal 
discussion to follow can jump directly to Sec. V, where 
our algorithm is formulated, after reading the beginning 
of Sec. II to get acquainted with our notation. In Sec. 
VI we present a sample of curves representing the con- 
nectivity distributions calculated for finite systems using 
our code, compared to the analogous curves generated 
from the growing network recipe of refs. and we 

briefly discuss some finite size effects. 

In Sec. VII we summarize our results and we discuss 
the problems related to the generalization of the present 
approach to models more complicated than the connected 
tree model discussed at length in the previous sections. 
We also list some open questions. In this paper, when 
necessary, we make use of some results, found in a differ- 
ent context, scattered in earlier publications we have co- 
authored p^-p^. We believe, that it is useful to adapt 
these results to the present context, putting them in a 
new perspective and making them accessible to a differ- 
ent community. 



II. MINIFIELD THEORY AND GRAPH 
ENSEMBLES 

Our starting idea is to use a toy field theory in di- 
mensions - we call it a minifield theory - and to identify 
the Feynman diagrams of this theory with the graphs of 
our ensemble. The weights attached to the graphs are 
the amplitudes calculated using the standard Feynman 
rules, the model being, of course, formulated in such a 
manner that these weights are positive. 

Let us define the minifield theory by the following 
formal integral 
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By assumption the real constants Pn are non-negative. 
Hence, strictly speaking, the integral does not exist. 
However, expanding the second term in the exponential 
and performing the integrations over one obtains an 
unambiguously defined series. The terms of this series 
can be represented, as usual in field theory, by Feynman 
diagrams. The convergence of the series is, in general, un- 
certain and requires some consideration. When the series 
converges, its sum, also denoted by Z, can be regarded 
as the partition function of a graph ensemble. 
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FIG. 2. The labeled connected Feynman diagrams repre- 
senting the contributions to the rhs of (^). With each one of 
these graphs is associated the weight 2[p2A]*/3. 

The content of the above paragraph will appear obvi- 
ous to the practitioners of field theory, but may appear 
fortuitous to some readers working on networks, but not 
totally familiar with the former. We have no space to 
explain the point at length. Hopefully, a simple example 
may be of some help. Consider the following contribution 
to Z: 
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We need to calculate the eighth moment of a Gaussian. 
We shall do it in a fancy way. For book-keeping purposes 
we distinguish by a label the four "interaction terms" 
that are being multiplied above: 
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and we represent each label by a point in a plane. We 
observe that every moment of a Gaussian is fully deter- 
mined by its second moment, which in our case equals 
kA. We replace all binomials appearing in the integrand, 
0(i)0(j) kX and we join with a line the points repre- 
senting the labels i and j. The latter results in a specific 
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graph, a Feynman diagram. The moment to be calcu- 
lated is obtained by summing over all possible matchings 
of four pairs of labels. All the connected graphs cor- 
responding to our example are shown in Fig. 2. One 
checks easily that to each of these diagrams correspond 
16 distinct matchings of the same four pairs of labels. 
Thus, the contribution to the rhs of (||) of each of these 
Feynman diagrams is A*[p22!]*/4! . The full result is ob- 
tained by taking into account the disconnected diagrams 
too. In some diagrams a line connects a point to itself 
{i = j), these are the so-called "tadpoles". We shall not 
list all these diagrams here. 

The point is that the Feynman diagrams of the mini- 
field theory are the graphs familiar to people working on 
networks, except that there is a specific weight attached 
to each such a graph. The graphs are not necessarily 
connected, except if one imposes an appropriate extra 
constraint, defining a graph sub-ensemble. 

We assume that pi , which plays the role of an external 
current, is strictly positive. Hence, in our minifield the- 
ory there exist graphs with nodes that are " external" , ie 
of unit degree. 

The weight associated with a Feynman diagram that 
is "non-degenerate" - ie does not have tadpoles and mul- 
tiple connections between nodes - and whose nodes are 
labeled, is 

weight = n (4) 

where N and L denote the total number of nodes and 
links respectively. When tadpoles and/or multiple con- 
nections are present one has to multiply the rhs by the 
usual symmetry factors. In graph theory some authors 
accept the existence of degeneracies in the definition of 
what they mean by a graph. In most texts the degenera- 
cies are excluded. The class of graph models we propose 
is fairly general, but not the most general. Indeed, by 
construction the weight of a graph is given in (|^) by a 
product of weights associated with individual nodes. A 
possible generalization will be proposed later on. 

As is well known from field theory, W = nlnZ gener- 
ates the connected graphs. Only tree graphs survive in 
the "semi-classical" limit k — > 0. From here on we focus 
on the statistical ensemble of connected tree graphs. We 
shall return to more general graphs in the last section. 

In this paper we are interested in models where the ra- 
dius of convergence of the series in the exponent on the 
rhs of (||) is finite. It is not a loss of generality to set 
this radius to unity. It is evident from (^) that this can 
always be achieved by a multiplicative renormalization of 
A and k. 

The partition function of connected tree graphs W^'^'^'^^ 
is given by the stationary value of the action at (j) — $, 
$ being defined implicitly by the saddle-point equation 

<i> = A^ (5) 

n>0 



where we have introduced a shorthand notation i„ = 
npn- One easily checks that 

$ = dW^'""'''/dti (6) 

Hence $ generates tree graphs with one marked exter- 
nal node. Eq. with its nice graphical representation 
p^ , p^ - see Fig. 3 - has a long history and is recurrently 
rediscovered in the literature. 




FIG. 3. The graphical representation of eq. (g). With ev- 
ery node in the figure - the open circles - is associated the 
factor \tn. We use the shorthand notation t„ = np„. 

It is easy to see that $ is singular. Indeed, eq. (||) can 
only be satisfied if A is smaller than some critical value 
Ac = Ac(ii,i2, •■•)■ Equation (j^) can be rewritten as 

A-i=^^($) = ^t„$"-2 

n>0 

and by definition 

A-i - F(<i>o) (8) 

where $ = $o is the point in the interval [0, 1] where 
takes its minimal value. The critical behavior of 
the theory is found from the behavior of F{^) in the 
vicinity of $0. 

One observes that both the rhs of and its second 
derivative are positive for (f> > 0. Moreover, Ac is finite 
(because <i > 0). The concavity property of F{^) im- 
plies that one has, in general, the choice between only 
three possibilities (cf refs. ||l|,0,|2|) : 

(A) $0 < 1 and F'($o) = 0. 

(B) $0 = 1 and F'(l) = 0. 

(C) $0 = 1 and F'{\) < 0. 

These three cases will be discussed in detail later on. 

It is customary to introduce a susceptibility critical ex- 
ponent 7 controlling the most singular part of the second 
derivative of the partition function. Since <& is the first 
derivative of the partition function, we write 

$~(Ac-A)i-^ (9) 

The exponent 7 is finite in all cases of interest for us, as 
will be shown in the following sections, where it will be 
calculated. 

A factor A is associated with every link of a tree graph 
and has the physical significance of the fugacity of links. 
Since L = iV — 1, it can also be regarded as the fugacity 
of nodes. Hence 
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Taking the inverse Laplace transform one obtains for tree 
graphs the canonical partition function 
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iV> 1 



(11) 



Hence, the ensemble of random trees behaves like a stan- 
dard statistical mechanical ensemble. In particular, the 
number of trees with N nodes is exponentially bounded 
in TV and the canonical ensemble has an extensive en- 
tropy. 

We have not yet introduced the constraints associated 
with the absence of a scale. The quantity of experimen- 
tal interest is P„, the bulk connectivity distribution. It 
is Pn which should exhibit a power-like fall at large n. 
The relevant question is how to choose the series of the 
bare couplings p„ in order to get a given scale- free P„. 
The rest of this section is devoted to this problem. For 
tree graphs the solution can be found mapping the model 
of graphs on another one, known in the literature as the 
balls-in-boxes model [M. 

In Sec. 2.7.7 of ref. poll a series solution of eq. (^ is 
given : 



$ = ^ $Ar 



N 



with 



N 



-E- 



iV! 



I ''1 ■■■ ''AT 



(12) 



(13) 



and Vj being positive integers satisfying the constraints 
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Eq. ( p^ ) can be further rewritten as 

N^N ^^{N,2N -1) (15) 

with 

N 

n{N,M) = tn,-tn« SC^n, - M) (16) 

The summation is now on all sets of N positive inte- 
gers. The rhs is the partition function of a system of M 
balls distributed at random and with weights t„ among 
N boxes, empty boxes being forbidden. This model of 
weighted partitions is precisely the balls-in-boxes model 

ill- 

The effective box occupation distribution is t„r2(iV — 
\.M — n)/ft{N,M). Using the integral representation 
of the (discrete) (5-function one easily finds an integral 
representation of ^1{N,M), which can be used to calcu- 
late it by the saddle-point method in the limit iV ^ oo. 



p = M/N = const. The effective box occupation distri- 
bution follows. Of course, the tree graph model corre- 
sponds to p = 2. We shall not repeat this calculation 
here, referring the reader to ref. for details and quot- 
ing the result for p = 2 < {n)t = J2n "^n/ J2n • 
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up to an obvious normalization factor. Here $0 is to 
be found from F'{^o) = 0. This last condition can be 
rewritten as 



(18) 



The Ihs is an increasing function of $o- Consequently 
{n)t = 2 corresponds to $0 = 1 and therefore to the case 
(B) introduced earlier. Hence, for tree graphs and in the 
limit N ~^ 00 



U iff {n)t = 2 



(19) 



This result can be understood intuitively. The shape of 
the two distributions, P„ and i„, differs because of the 
constraint = "-j = 2. But for N —> 00 the con- 

straint is satisfied "for free" when {n)t = 2 and /3 > 2 by 
virtue of Khintchin's law of large numbers. 

The condition ([l9| ) solves our problem: the input 
couplings Pn of the tree-graph model should be set to 
Pn = Pn/n in order to get the scale- free connectivity dis- 
tribution Pn in the bulk. The distribution P„ must, of 
course, satisfy the constraint X„ nP„ — 2, otherwise it 
would not refer to trees [E2| . 



III. GEOMETRY OF SCALE-FREE TREE 
GRAPHS 

In this section we discuss the geometry of graphs be- 
longing to the statistical ensemble of scale-free trees. 
Hence, according to the results of the preceding section, 
we assume that tn ^ for n — > 00 and that {n)t — 2. 
We have already stated in the preceding section that 
this situation corresponds to the case (B): $0 = 1 and 
P'(l) = 0. 

It follows from general arguments that P($) has a 
branch point at $ = 1. The order of the singularity is 
determined by the shape of the tail of tn- For non-integer 
/? one finds (cf |l|) that |2|] 



P($) = polynomial in (1 - $) + (1 - $)'^~i 
+ higher order terms 



(20) 



The concavity of F{^) implies /3 > 2. The linear term in 
the polynomial is absent. 

When 2 < /9 < 3, inverting eq. ( ^l|) one finds the 
singular part of $: 



<i>sing ^ (Ac - A)i/(''-i) 



(21) 
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so that 7 = 2)/(/5- 1). 

When /? > 3 the inversion yields 



smg 



(22) 



Therefore, 7 = 1/2. 

We now consider two scahng exponents which have a 
direct geometrical interpretation in terms of the average 
"dimension" of the ensemble graphs. 

One can define a two-point correlation function C{x, A) 
equal, up to normalization, to the average number of 
pairs of nodes separated by a distance x. Using a graph- 
ical representation analogous to that of Fig. 3 one finds 
0: 



(23) 



The fractal dimension dn of the tree can be defined by 

x-^ log C{x, A) = -const(Ac - A)^/''" for x ^ 00 

(24) 

where it is assumed that | Ac — A |^ 1. Notice, that 
(Ac — A)~^ is a measure of the average size of a graph 
|24|. One finds finally 
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(25) 



Another definition of dimension is obtained by consid- 
ering a diffusion process on the graph; as is well known, 
diffusion can be mapped to a random walk. If we consider 
a walker departing from a certain point at time T = 0, 
then the return probability to the start of the walk is, 
for small T, proportional to T~'^=/^ where ds is the spec- 
tral dimension: the dimension of the graph as seen by a 
random walker mo ving on the graph. 

In was shown in ||l8|,^ that the spectral dimension dg 
is for 7 > given by 



(26) 



1 + 7 



Combining (26) and (Eq) one obtains the nice relation 
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(27) 



For /3 > 3 one finds dn — 2 and dg = 4/3. These are 
the generic values for a tree graph |lj,^^, which would 
be found also if there were only a finite number of non- 
vanishing couplings tn- The typical graph is a fully de- 
veloped tree, with many branching branches. 
For 2 < /3 < 3 one obtains: 



dH ^{f3-l)/{!3-2) 



4 = 2(/3~l)/(2/3-3) 



(28) 



(29) 



As /3 — > 2 the trees become increasingly crumpled, they 
look like a set of linked hedgehogs. As /3 approaches 2 the 
average distance between a pair of nodes starts growing 
very slowly with N. E.g. when /3 = 2.1 this distance is 
^ jSfi/dH — 7Vi/6 xhe increase is not logarithmic, as is 
sometimes claimed, but power-like. Of course, in prac- 
tice this does not make much difference, when the power 
is small. 

The upshot of the above discussion is that a scale-free 



large-degree behavior P„ 



/3 > 2, is associated 



with a variety of distinct graph geometries. It is quite 
remarkable, that at least for the tree graphs of the dis- 
cussed ensemble one can give a complete catalogue of 
expected geometries. 



IV. INSTABILITY OF SCALE-FREE REGIME 

The logical next step of the discussion is to examine 
what happens when the shape of t„ is so distorted that 
the constraint {n)t = 2 is broken. We continue assuming 
that tn ^ at large n. 

Consider first the case {n)t < 2. This condition is 
equivalent to F'{1) < 0. We recover the case (C) of Sec. 
II. Since the linear term in the polynomial on the rhs of 
( pl|) is now present, inverting eq. ( pi|) one gets 



'J'sing - (Ac - A)^-l 



(30) 



with j3 > 2. An explicit calculation using (gj) yields 
dn = 00. We also expect dg = 2. A typical tree graph 
looks like a set of linked hedgehogs. Moreover, as ex- 
plained below, a new feature shows up: a "singular" node 
with a fixed degree of 0{N), a hedgehog even more spiky 
than the others. 

To see this, it is instructive to understand how the con- 
straint sjv = 2 is satisfied. Notice that if we omit from 



Sat a few terms, say nj-^ , 



the resulting sum will 



fluctuate around {n)t. Hence, for N 00 



sn 



N 



(31) 



But since sn = 2, it is clear that some, at least, of 
the contributions to the sum on the Ihs must be of or- 
der 0{N). Since the probability distribution falls like a 
power, it is most probable that there is only one such con- 
tribution. Thus, one expects the appearance of a char- 
acteristic "singular" node with degree n — {2 — {n)t)N. 
Other nodes have degree distribution ~ t„: 



P,,^^ + ls[n-i2-{n),)N] 



(32) 



The normalization factor in front of d is determined by 
the tree condition J^n "--P" — 2. 

The other possible case is {n)t > 2. This condition is 
equivalent to F'{1) > 0, which by virtue of the concav- 
ity of F($) implies that $0 < 1 ■ This is the case (A) 
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of Sec. II. Hence P„ equals i„ times an exponentially 
falling factor. A scale in the node order distribution is 
spontaneously generated. The dramatic fluctuations of 
node degrees are absent. 

Expanding F{^) in the neighborhood of (f> — $0, 
where its derivative is bound to vanish and inverting the 
resulting expansion in order to find the most singular 
part of $ one gets 



smg 



(A,-A)- 



(33) 



Hence 7 = 1/2 and one again obtains the generic values 
dn = 2 and 4 = 4/3. 

The above two scenarios are the only alternatives to 
the scale-free behavior of the discussed statistical ensem- 
ble of tree graphs. 

One can show that the transitions between different 
phases of the model are continuous and that the degree 
of their softness varies when the couplings i„ change. 
There is no phase transition between the generic phase 
(A) and the scale-free regime (B) with /3 > 3. 



V. COMPUTER ALGORITHM 

In the synchronic approach the construction of graphs 
does not mimic any real physical process. The empha- 
sis is on the flexibility and on the efficiency in producing 
a sample of graphs to work with. We use the following 
notation : R{n) = pn/pn-i- 

We propose an algorithm working for a given fixed N 
and L. It rewires links, generating Feynman diagrams of 
the minifield theory, proceeding as follows p^ : 

(a) Pick a random oriented link ij. 

(b) Pick a random target node k . 

(c) Apply the Metropolis test: rewire the link ij to ik 

with probability 

Prob(rewiring) = [uk + l)R{nk + l)/njR{nj) (34) 

when the rhs above is less than unity and with probabil- 
ity equal to unity otherwise. 

These Feynman diagrams are properly weighted. The 
algorithm is probing all possible connections within a set 
of distinguishable nodes. For non-degenerate diagrams 
the Metropolis condition immediately follows from the 
detailed balance equations together with the weight (Q) 
given to microstates. For general diagrams the symmetry 
factors, related to the presence of tadpoles and multiple 
connection between nodes, also come out correctly. This 
is again a consequence of the detailed balance : if two 
nodes are connected by m links, the transition to a state 
where the nodes are connected by m — 1 links can be 
done in m ways. The inverse operation has only one re- 
alization. When pn = 1/nl the Metropolis test is always 



positive and the non-degenerate labeled diagrams pro- 
duced by the algorithm are equiprobable, as they should 
be. We have checked the validity of the general argument 
on a variety of examples. 

In order to produce connected tree graphs, one starts 
from a connected tree configuration, eg a polyline with 
free ends and one adds, before the Metropolis test, a 
check insuring that k ^ i, j and rii = 1. It is easy to con- 
vince oneself that with this constraint the tree is never 
broken into parts. As explained in Sec. II, the scale-free 
connectivity distribution in the bulk is obtained setting 
Pn ~ Pn/n. 

Notice, that with a constraint added before the 
Metropolis test, the algorithm explores a well defined 
sub-ensemble of graphs. However, because the detailed 
balance is satisfied, this sub-ensemble is in equilibrium, 
or, more precisely fluctuates around equilibrium (see also 
the remark on that matter in the last section). Thus, 
within the sub-ensemble in question the algorithm tends 
to sample "typical" graphs. 
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FIG. 4. Comparison of the connectivity distribution pro- 
duced by our Monte Carlo algorithm (black) and that of ref. 
[7] (grey) for N = 2" and /3 = 2.1 and 3.0, respectively. 
The points are joined by a line to guide the eye. The almost 
straight lines correspond to the asymptotic P„ given in (35). 



VI. NUMERICAL EXAMPLES AND FINITE-SIZE 
EFFECTS 



Since we propose a new algorithm for generating ran- 
dom graphs, it may be of interest to the reader to see how 
our results compare to those obtained with an algorithm 
already existing on the market. As an example to com- 
pare with we take the Growing Network model of ref. , 
which is a slight generalization of the algorithm proposed 
originally in ref. Q and which also generates connected 
trees. The asymptotic analytic solution for P„ is given 
by eq. (12) in the second paper of ref. § (see also g): 



Pn = (/? - 1) 



r(2/3 - 3)r(n -t- /3 - 3) 
r(/3 - 2)r(n 2/5 - 3) 



(35) 
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FIG. 5. The connectivity distribution calculated for our en- 
semble of tree graphs, at finite N = 2^ , k — 14, 15, 16, 17 and 
for f} = 2.1 and 3.0 . The almost straight lines correspond 
to the asymptotic P„ given in (35). The figure illustrates the 
finite-size cut-off of the order distribution discussed in the 
text. 

We use this P„ as input in our code. A sample result 
is shown in Fig. 4, where we compare the results of the 
calculation using the two algorithms, respectively. The 
calculated points are practically indistinguishable and 
follow the input distribution, except for those n where 
finite-size effects cut and distort the distribution. 

Indeed, when N is finite, the distribution of node de- 
grees cannot extend to infinity, there is an upper cut-off 
on n : n < rimax- The cut-off can be estimated by im- 
posing the condition that the expected number of nodes 
found above rimax is at most equal to unity. Then 

oo 

N^n-'^ = const (36) 

The value of the constant on the rhs depends on the de- 
tailed shape of P„ . Eq. (|36| ) leads to the scaling law 

nm.. ~ N'^<^P-'^ (37) 

analogous to the result of 0,^, where instead of N ap- 
pears the time. Beyond rimax the node distribution func- 
tion falls very abruptly and can be neglected for all prac- 
tical purposes. Notice, that rimax may be much smaller 
than N. As noticed in ref. Q this explains why large 
scale-free graphs with f3 significantly larger than 3 are 
not observed in Nature. 

In Fig. 5 we show the evolution of _P„ with N. In or- 
der to obtain a very precise result we use, instead of the 
Monte Carlo, an analytic recursion formula satisfied by 
the equivalent partition function given by eq. (|l|) of Sec. 
II (see footnote ||2l[] in that section). When the abscissa 
in Fig. 5 is rescaled, for different N and using (|37|), the 
curves corresponding to the same /3 are cut at the same 
place. 



VII. SUMMARY AND DISCUSSION 



A. Connected tree graphs: a tentative summary 



Let us summarize what has been achieved so far: We 
have started by introducing a fairly general ensemble of 
random graphs, but we have rapidly focused our atten- 
tion on the sub-ensemble of connected tree graphs. We 
have found the conditions insuring that the latter are 
scale-free. We have discussed in detail the geometry of 
the connected tree graphs. We have also constructed a 
computer algorithm generating these graphs with an ar- 
bitrary bulk connectivity distribution _P„. Since our an- 
alytic discussion may sound somewhat formal for some 
readers, let us illustrate it with a simple, but very in- 
structive example. 



a 
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8 




crumpled 


3 




y^^^^_^^^-SMle-fr 


o 


generic 





I 2 3 p 4 

FIG. 6. Phase diagram for the simple two parameter model 
presented in Sec. VII: ti — 1, t„ — —n'~'^, n > 1. The part 
of the continuous line on the right of the black dot corresponds 
to the scale-free graphs belonging to the generic phase. 

Consider a two-parameter family of input weights i„ = 
npn'- ^1 = 1 and tn — ^ n^^, n > 1. The condition 
insuring that the trees are scale-free was found in Sec. 
II and is {n)t = X^n^-^^/Sn^" ~ ^- ^^"^ example in 
question it is equivalent to the following relation between 
the parameters a and /3: 

a = 1 + C(/3 - 1) - 2C(/3) (38) 

where CiP) is the Riemann Zeta function. The phase di- 
agram [|l5| is shown in Fig. 6 : the line is calculated from 

As one moves along this line in the direction of decreas- 
ing /?, the geometry remains generic down to /? = 3 and 
becomes progressively more and more crumpled below 
that point. In particular, the fractal dimension increases 
from the initial value 2 to oo (see eq. (p^)). This be- 
havior is easily understood. When /3 is moved towards 2 
the convergence of the series deteriorates. The only way 
to keep the average {n)t constant, viz. equal to 2, is to 
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reduce the normalization of the tail. This means, how- 
ever, that the relative weight of nodes with unit degree 
gets larger. Eventually, the number of "external" nodes 
increases with N much faster than the number of large 
degree nodes. The latter are usually connected to the 
former, building hedgehog-like structures. 

Off the line (^) one falls into one of the alternative 
phases discussed in Sec. IV: either a scale is gener- 
ated spontaneously and P„ falls exponentially, or else 
there appears a "singular" node with huge connectivity 
rising = N{2 — {n)t). In the former case the graphs have a 
typical extension ^ a/ZV while in the latter case they are 
crumpled and have infinite fractal dimension. Because of 
the constant presence of the " singular" vertex the diam- 
eter of the graph grows slower than any power of N for 
N ^ oo. 

Notice, that the condition (3 > 2 appears often in our 
discussion. Indeed, at least in the framework adopted in 
this paper, a scale-free graph with a different exponent 
cannot exist. Actually, the arguments used towards the 
end of Sec. IV can also be used when {n)t = oo and one 
predicts an exponentially falling P„ at the " output" . 

We have considered the stability of scale-free graphs 
with respect to possible deformations of the sequence of 
weights. Indeed, in Nature a random graph does not exist 
in isolation. It is an open system, subject to interactions 
with, loosely speaking, a heat bath. It is thus interesting 
to know what is expected to happen when this interaction 
becomes strong. 

Our results reflect the behavior of "typical" graphs. 
But very "atypical" values of global parameters, for ex- 
ample of the spectral dimension, can be found when 
a specific large graph is analyzed |2^. Our algorithm 
makes it possible to measure the degree of this " atypical- 
ity" : a numerical simulation enables one to estimate the 
probability of a given fluctuation of the global parameter. 



B. More general graph models 

1. Weighted connecttons 

Our graph model has been constructed in such a way 
that the weight of a graph is a product of weights asso- 
ciated with individual nodes. A possible generalization 
could consist in replacing the starting graph generating 
function defined in (pT) by 



1 



exp ■ 



n=l 



(39) 



where A is a positive definite matrix of rank q and 
(j) = (01, ■■.,4'q). The series expansion of the rhs gener- 
ates graphs with nodes of degree 1,2, q and a weight 
oc attached to every link connecting a pair of nodes 
of degree m and n. The limit q ^ oo should be handled 
with care. In a study of tree graphs the limit k ^ 



should be taken first. The investigation of this model 
goes, however, beyond the scope of the present paper 



2. Matter on graphs 

Another generalization would consist in putting " mat- 
ter" fields on graphs. Ising spins on " small- world" graphs 
[p9| were already discussed in ref. |30|. Ising spins liv- 
ing on the nodes of quenched Feynman diagrams gen- 



erated by (j) or (j) interactions were studied by many 
people (see, for example, Q and the long hst of refer- 
ences therein), in particular with the aim to determine 
the critical behavior of the spin system. It would be very 
interesting to extend these studies to scale-free graphs 
as defined in the present paper and to see, for example, 
what is the effect of changing (3 and therefore also the 
fractal dimension of the graphs. 



3. Graphs with cycles 

The most important generalization consists in extend- 
ing the study to graphs with cycles. Introduction of a 
fixed number of cycles does not seem to be a big deal in 
the present context (see, however, ref. [0). For exam- 
ple, when N oo the Hausdorff dimension is expected 
to be insensitive to the presence of a fixed number of cy- 
cles. On the other hand, the case where the ratio L/N is 
kept fixed at some arbitrary value when — > cx) is very 
challenging. 

There are conceptual problems related to the over- 
extensive nature of the ensemble. It is not difficult to 
count the number of non-degenerate graphs with given 
A^ and L. It is sufficient to consider the number of dis- 
tinct adjacency matrices Cy =1 or 0, for i,j connected 
or not, respectively: 



^ non-degenerate graphs(A^, L) 



^N(N-l)\ 

L (40) 



In general the rhs behaves like exp (const A^ log A^) when 
N ^ oo, L/N fixed. A similar result is easily obtained 
for degenerate graphs. The entropy is not extensive. This 
is easy to understand. It is impossible to divide an arbi- 
trary graph into two parts separated by a "boundary" of 
negligible measure. Any two nodes can interact. Hence, 
it is not certain, in general, that the bulk distribution P„ 
calculated in the ensemble coincides with that obtained 
from a single, sufficiently large graph. Tree graphs are a 
notable exception: in the " semiclassical limit" k ^ the 
entropy becomes extensive. 

Our algorithm, in its most general version, generates 
all possible Feynman diagrams and can in principle be 
used to simulate arbitrary graphs. The problem is to 
find the constrains on the couplings pn insuring that the 
bulk connectivity distribution has a desired form. 
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The presence of the factor (n^ + l)/n_,- in ( [3^ ) means 
that the rewired nodes are sampled independently of 
their degree (without this factor nodes belonging to a 
large number of links would be rewired preferentially) . If 
no extra check is performed before the Metropolis one, 
the algorithm does not know about the underlying graph 
structure. It plays with node degrees and nodes as it 
would play with balls and boxes in the balls-in-boxes 
model with occupation weights p„. Hence a scale free 
Pn is obtained setting p„ ~ P„ (and L = ^N{n)p). The 
graphs produced that way are degenerate, there are tad- 
poles and multiple connections between nodes. They are, 
in general, not connected - it is very difficult to avoid pro- 
ducing disconnected graphs - but for /3 < 3 most of nodes 
belong to the giant component. The conditions of its ap- 
pearance are well understood |3^,|l^. The connectivity 
distribution within the giant component has the same 
scale-free behavior at not too small n 

Our present experience is that the algorithm has no 
difficulty to equilibrate the ensemble. The algorithm de- 
fines a Markov process. We do not have any proof that 
the next-to-largest eigenvalue of the transition matrix is 
separated from unity by a gap remaining finite when 
N oo. But it seems very likely that it is so, since 
we do not expect any collective effects to occur. 

A different algorithm, generating graphs with an arbi- 
trary connectivity, is proposed and used in rcfs. Jl9| , ^ . 
In this algorithm a graph is generated in two steps. First, 
a list of N degrees, ni, n2, rtAr, is sampled out of the 
distribution P„. In a sense, one creates first N nodes 
with attached links having the other end free. Then, in 
the second step, one connects together at random the 
free ends of links. The produced graph is, in general, 
not connected and degenerate. The ensemble is defined 
by repeating this procedure over and over. In the limit 
N ^ oo the connectivity distribution is identical to P„ 
for individual graphs. Notice, that L = nj is not 

fixed. 

Our approach, rooted in field theory, is different. We 
start with micro-states weighted so that the connectivity 
distribution becomes scale-free after averaging over the 
ensemble. But, as already mentioned, the scale-free con- 
nectivity within individual graphs is, strictly speaking, 
not guaranteed, except for the ensemble of trees: aver- 
aging within a single graph of infinite size is, in general, 
not equivalent to averaging over the ensemble of graphs. 
This is perhaps a flaw, but on the whole the virtues of 
the two approaches seem complementary. We hope to 
develop this point elsewhere. 

In order to construct non-degenerate graphs it suffices 
to verify, before performing the Metropolis check, that 
the proposed rewiring does not produce tadpoles and/or 
multiple connections. We have a code doing that effi- 
ciently. The real problem, which we have not solved yet, 
is to find for non-degenerate graphs the set of couplings 
Pn producing the desired scale- free bulk distribution P„. 
Setting Pn = Pn gives for, say, (3 = 3 results that are en- 
couraging, although the cut-off rimax is smaller than in 



the degenerate case. However, when f3 is decreased things 
worsen. No doubt, the problem deserves more theoretical 
and numerical work. 

We are tempted to conjecture that the degeneracy does 
not matter for large scale features of graphs. The ab- 
sence of any natural scale suggests that the graphs are 
self-similar. If so, one can decimate nodes as follows: 
pick a node at random and shrink to a single point all its 
immediate neighbors. If the graph is really self-similar, 
decimating nodes should not alter its large scale features. 
But such a decimation will necessarily produce degener- 
ate graphs. For the moment, this conjecture is a specu- 
lation. However, our experience with simulating random 
surfaces suggests that this possibility should not be dis- 
regarded: degenerate randomly triangulated surfaces - 
those whose duals have "tadpoles" and "self-energy" in- 
sertions - turn out to fall into the same universality class 
as the non-degenerate surfaces, but are much easier to 
simulate. 

Finally, let us note that it is straightforward to pro- 
duce a algorithm analogous to the one proposed in this 
paper but constructing directed graphs. 
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